A Monitoring System for the BaBar INFN Computing Cluster
نویسندگان
چکیده
Monitoring large clusters is a challenging problem. It is necessary to observe a large quantity of devices with a reasonably short delay between consecutive observations. The set of monitored devices may include PCs, network switches, tape libraries and other equipments. The monitoring activity should not impact the performances of the system. In this paper we present PerfMC, a monitoring system for large clusters. PerfMC is driven by an XML configuration file, and uses the Simple Network Management Protocol (SNMP) for data collection. SNMP is a standard protocol implemented by many networked equipments, so the tool can be used to monitor a wide range of devices. System administrators can display informations on the status of each device by connecting to a WEB server embedded in PerfMC. The WEB server can produce graphs showing the value of different monitored quantities as a function of time; it can also produce arbitrary XML pages by applying XSL Transformations to an internal XML representation of the cluster’s status. XSL Transformations may be used to produce HTML pages which can be displayed by ordinary WEB browsers. PerfMC aims at being relatively easy to configure and operate, and highly efficient. It is currently being used to monitor the Italian Reprocessing farm for the BaBar experiment, which is made of about 200 dual-CPU Linux machines.
منابع مشابه
Preliminary Characterization Tests of Detectors of on-Line Monitor Systems of the Italian National Center of Oncological Hadron-Therapy (CNAO)
Introduction Hadron-therapy is an effective technique used to treat tumors that are located between or nearby vital organs. The Italian National Center of Oncological Hadron-therapy (CNAO) has been realized as the first facility in Italy to treat very difficult tumors with protons and Carbon ions. The on-line monitor system for CNAO has been developed by the Department of Physics of the Univers...
متن کاملParallel computing using MPI and OpenMP on self-configured platform, UMZHPC.
Parallel computing is a topic of interest for a broad scientific community since it facilitates many time-consuming algorithms in different application domains.In this paper, we introduce a novel platform for parallel computing by using MPI and OpenMP programming languages based on set of networked PCs. UMZHPC is a free Linux-based parallel computing infrastructure that has been developed to cr...
متن کاملNew Physics Searches at B A B AR F . Renga Università di Roma “ La Sapienza ” and INFN Roma ( for the BABAR
We will present the most recent results from the BABAR Collaboration concerning New Physics searches in rare B and Lepton Flavour Violating (LFV) decays, including b → s transitions, purely leptonic B decays and LFV τ decays.
متن کاملDesign and Evaluation of a Pressure and Temperature Monitoring System for Pressure Ulcer Prevention
Introduction Pressure ulcers are tissue damages resulting from blood flow restriction, which occurs when the tissue is exposed to high pressure for a long period of time. These painful sores are common in patients and elderly, who spend extended periods of time in bed or wheelchair. In this study, a continuous pressure and temperature monitoring system was developed for pressure ulcer preventio...
متن کاملDistributed Offline Data Reconstruction in BaBar
Anders Ryd California Institute of Technology, Pasadena, CA 91125 Alberto Crescente, Alvise Dorigo, Fulvio Galeazzi, Mauro Morandin, Roberto Stroili, Gionni Tiozzo, Gabriele Vedovato INFN Padova, I-35131 Padova, Italy Francesco Safai Tehrani INFN Rome, I-00185 Rome, Italy Teela Pulliam Ohio State University, Columbus, Ohio 43210 Peter Elmer Princeton University, Princeton, NJ 08544 Antonio Cese...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره cs.PF/0305054 شماره
صفحات -
تاریخ انتشار 2003